Automatic Modelling of Rhythm and Intonation for Language Identification
نویسندگان
چکیده
This paper deals with an approach to Automatic Language Identification using only prosodic modeling. The traditional approach for language identification focuses mainly on phonotactics because it gives the best results. Recent studies reveal that humans use different levels of perception to identify a language, in particular prosodic cues. Among prosodic features, rhythm is known to carry a substantial information about language identity. Rhythm is produced by the periodicity of a pattern that can be syllable, which is a language specific unit. That is why we introduced the notion of ”Pseudo-Syllable”, derived from the most frequent syllable structure in the world, the Consonant-Vowel structure. In this paper, an automatic and language independent rhythmic units extraction algorithm is described: using a vowel detection algorithm, rhythmic units matching the Consonant-Vowel structure are segmented. Two models describing rhythm and intonation of each language are then learned using Gaussian Mixtures.
منابع مشابه
Language identification with suprasegmental cues: a study based on speech resynthesis.
This paper proposes a new experimental paradigm to explore the discriminability of languages, a question which is crucial to the child born in a bilingual environment. This paradigm employs the speech resynthesis technique, enabling the experimenter to preserve or degrade acoustic cues such as phonotactics, syllabic rhythm, or intonation from natural utterances. English and Japanese sentences w...
متن کاملRhythmic unit extraction and modelling for automatic language identification
This paper deals with an approach to automatic language identification based on rhythmic modelling. Beside phonetics and phonotactics, rhythm is actually one of the most promising features to be considered for language identification, even if its extraction and modelling are not a straightforward issue. Actually, one of the main problems to address is what to model. In this paper, an algorithm ...
متن کاملComputer Assisted Pronunciation Teaching (CAPT) and Pedagogy: Improving EFL learners’ Pronunciation Using Clear Pronunciation 2 Software
This study examined the impact of Clear Pronunciation 2 software on teaching English suprasegmental features, focusing on stress, rhythm and intonation. In particular, the software covers five topics in relation to suprasegmental features including consonant cluster, word stress, connected speech, sentence stress and intonation. Seven Iranian EFL learners participated in this study. The study l...
متن کاملUsing prosody and phonotactics in Arabic dialect identification
While Modern Standard Arabic is the formal spoken and written language of the Arab world, dialects are the major communication mode for everyday life; identifying a speaker’s dialect is thus critical to speech processing tasks such as automatic speech recognition, as well as speaker identification. We examine the role of prosodic features (intonation and rhythm) across four Arabic dialects: Gul...
متن کاملA computer-assisted prosody pronunciation teaching system
Work in the last decade shows, that Computer-Assisted Pronunciation Teaching (CAPT) systems are useful, flexible tools for giving pronunciation instructions and evaluating at subject’s speech. This paper describes a newly developed CAPT system that intends to address appropriate teaching of such supra-segmental parameters as intonation, stress and speech rhythm. Two modules are implemented: (1)...
متن کامل